IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
APPLICATION FOR PATENT 



SCANNI NG SYSTEM FOR INSPECTING ANAMOLIES ON SURFACES 
Inventors: Mehrdad Nikoonahad and 
Stanley E. Stokowski 

Cross-Reference to Related Applications 

This application is a continuation-in-part 
application of parent application entitled "Optical 
Scanning System for Surface Inspection, " by Mehrdad 
Nikoonahad, Keith D. Wells, and Brian C. Leslie, Serial 
No. 08/351,664 filed December 8, 1994. This application 
is also related to the patent application entitled 
"Optical Wafer Positioning System," by Mehrdad 
Nikoonahad, Philip R. Rigg, Keith D. Wells, and David S. 
Calhoun, Serial No. 08/361,131 filed December 21, 1994 
("Related Application"). Both prior applications are 
incorporated by reference herein in their entirety. 

Background of the Invention 

This invention relates in general to surface 
inspection systems, and in particular, to a high speed 
scanner system for inspecting anamolies on surfaces such 
as semiconductor wafers, photomasks, reticles, ceramic 
tiles, and other surfaces. 

The size of semiconductor devices fabricated 
on silicon wafers has been continually reduced. At the 
time this application is filed, for example, 
semiconductor devices can be fabricated at a resolution 
of a half micron or less and sixty-four (64) megabyte 
DRAMs are being fabricated with 0.35 micron design rule. 
The shrinking of semiconductor devices to smaller and 
smaller sizes has imposed a much more stringent 
requirement for sensitivity of wafer inspection 
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instruments which are called upon to detect contaminant 
particles and pattern defects that are small compared to 
the size of the semiconductor devices. On the other 
hand, it is desirable for wafer inspection systems to 
5 provide an adequate throughput so that these systems can 
be used in production runs to detect defective wafers. 

In U.S. Patent No. 4,898,471 to Stonestrom et 
al. assigned to the present assignee of this 
application, the area illuminated on a wafer surface by 

10 a scanning beam is an ellipse which moves in the scan 
direction. in one example given by Stonestrom et al., 
the ellipse has a width of 20 microns and a length of 
115 microns. Light scattered by anamolies or patterns 
in such illuminated area is detected by photodetectors 

15 placed at azimuthal angles in the range of 80 to 100° . 
The signals detected by the photodetectors are used to 
construct templates. When the elliptical spot is moved 
in the scan direction to a neighboring position, 
scattered light from structures within the spot is again 

20 detected and the photodetector signal is then compared 
to the template to ascertain the presence of contaminant 
particles or pattern defects as opposed to regular 
pattern. In Stonestrom et al., the scanning beam scans 
across the entire wafer to illuminate and inspect a long 

25 narrow strip of the wafer extending across the entire 
dimension of the wafer in the scanning direction. The 
wafer is then moved by a mechanical stage in a direction 
perpendicular to the scanning direction for scanning a 
neighboring elongated strip. This operation is then 

30 repeated until the entire wafer is covered. 

While the system of Stonestrom et al. performs 
well for inspecting wafers having semiconductor devices 
that are fabricated with coarser resolution, with the 
continual shrinking of the size of the devices 

35 fabricated, it is now desirable to provide an improved 



inspection tool that can be used to detect very small 
size anamolies that may be difficult to detect using 
Stonestrom et al.'s system. 

Summary of the Invention 
5 This invention is based on the recognition 

that very small anamolies can be detected by reducing 
the size of the area that is illuminated by the scanning 
light beam. Light scattered from structures in the spot 
will include background, such as light scattered by 

10 pattern on the surface, as well as light that is 
scattered by anomalies such as contaminant particles, 
pattern defects or imperfections of the surface. Such 
background can have a significant amplitude. For this 
reason, if the anamoly is of a size which is small 

15 compared to the size of the illuminated area, the 
scattered light from such anamoly may be overwhelmed by 
and become undetectable from the background. By 
reducing the size of the illuminated area or spot size, 
the ratio of the light intensity scattered by ah anomaly 

20 to that of the background will be increased, thereby 
increasing detection sensitivity. However, with a 
smaller spot size, it will be more difficult to maintain 
the uniformity of the spot along a long straight scan 
line across the entire wafer. By breaking up the scan 

25 path into short segments, it is possible to employ a 
smaller spot size while at the same time maintaining 
uniformity of the spot along the path. From the system 
point of view, by reducing the length of the scan, the 
size of the collection optics for detecting forward 

3 0 scattered light becomes more manageable. 

Thus one aspect of the invention is directed 
towards a method for detecting anamolies on a surface, 
comprising the steps of directing a beam of light at a 
grazing angle towards the surface, causing relative 
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motion between the beam and the surface so that the beam 
scans a scan path covering substantially the entire 
surface; and collecting light scattered along said path 
for detecting anamolies. The scan path includes a 
5 plurality of arrays of scan path segments, wherein each 
of at least some of such scan path segments has a span 
shorter than the dimensions of the surface. 

As used in this application, "minimum width 1 ' 
of the illuminated area or spot on the surface to be 

10 inspected is defined as the minimum dimension of a 
boundary around the area or spot along any direction on 
the surface, where the boundary is defined as the 
locations on the surface where the illumination light 
intensity is a predetermined fraction or percentage of 

15 the maximum intensity of illumination in the area or 
spot. In the description of the preferred embodiment, 
for example, the boundary is where the light 
illumination intensity is 1/e 2 of the maximum intensity 
of illumination in the area or spot, e being the natural 

20 number. The minimum dimension is the minimum distance 
between two parallel lines that enclose between them the 
boundary of the area or spot. The term "minimum width" 
is explained in more detail below. 

Another consideration of the invention is to 

25 provide an adequate throughput while data is collected 
at a moderate rate for defect detection so that the data 
collection and processing system employed need not be 
overly complex and expensive. 

Thus another aspect of the invention is 

3 0 directed towards a method for detecting anamolies on the 
surface of a semiconductor wafer, comprising directing 
a beam of light towards a surface to illuminate an area 
of the surface defining a spot having a spot size whose 
minimum width is in the range of about 5 to 15 microns, 

3 5 causing relative motion between the beam and the wafer 
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so that the beam scans a path covering the entire 
surface; and collecting light scattered along said path 
for detecting anamolies. The spot size and the 
directing and causing steps are such that the beam 
5 scanning substantially inspects the entire surface of 
the wafer at a throughput in excess of about 4 0 wafers 
per hour for 150 millimeter diameter wafers , at a 
throughput in excess of about 20 wafers per hour for 200 
millimeter diameter wafers , and at a throughput in 

10 excess of about 10 wafers per hour for 300 millimeter 
diameter wafers. 

Yet another aspect of the invention is 
directed towards a method for detecting anamolies on a 
surface, comprising the steps of directing a beam of 

15 light towards the surface to illuminate an area of the 
surface defining a spot having a spot size whose minimum 
width is in the range of about 5 to 15 microns, causing 
relative motion between the beam and the surface so that 
the beam scans a path covering substantially the entire 

2 0 surface; and collecting light scattered along said path 
for detecting anamolies. The spot size and said 
directing and causing steps are such that the surface is 
inspected at a speed not less than about 1.5 cm 2 /s. 

Still another aspect of the invention is 

2 5 directed towards a method for detecting anamolies on a 

surface, comprising the steps of directing a beam of 
light towards said surface to illuminate an area of the 
surface defining a spot having a spot size whose minimum 
width is in the range of about 5 to 15 microns, causing 

3 0 relative motion between the beam and the surface so that 

the beam scans a path covering substantially the entire 
surface; and collecting light scattered along said path 
for detecting anamolies. The surface has dimensions of 
not less than 200 millimeters in any direction along the 
35 surface. The directing and causing steps are such that 



the beam scans substantially the entire surface in about 
50 to 90 seconds. 

Another aspect of the invention is directed 
towards a system for detecting anamolies on a surface, 
comprising means for directing a beam of light at a 
grazing angle toward said surface; means for causing 
relative motion between the beam and the surface so that 
the beam scans a scan path covering substantially the 
entire surface; and means for collecting light scattered 
along said path for detecting anamolies. The scan path 
includes a plurality of arrays of scan path segments, 
wherein each of at least some of such scan path segments 
has a span shorter than the dimensions of the surface. 

One more aspect of the invention is directed 
towards a system for detecting anamolies on a surface of 
a semiconductor wafer , comprising means for directing a 
beam of light towards said surface to illuminate an area 
of the surface defining a spot having a spot size whose 
minimum width is in the range of about 5 to 15 microns; 
means for causing relative motion between the beam and 
the wafer so that the beam scans a path covering 
substantially the entire surface; and means for 
collecting light scattered along said path for detecting 
anamolies. The spot size and said directing and causing 
means are such that the beam -scanning substantially 
inspects the entire surface of the wafer at a throughput 
in excess of about 40 wafers per hour for 150 millimeter 
diameter wafers , at a throughput in excess of about 2 0 
wafers per hour for 200 millimeter diameter wafers, and 
at a throughput in excess of about 10 wafers per hour 
for 3 00 millimeter diameter wafers. 

Yet another aspect of the invention is 
directed towards a system for detecting anamolies on a 
surface, comprising means for directing a beam of light 
toward said surface to illuminate an area of the surface 
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defining a spot having a spot size whose minimum width 
is in the range of about 5 to 15 microns; means for 
causing relative motion between the beam and the surface 
so that the beam scans a path covering substantially the 
5 entire surface; and means for collecting light scattered 
along said path for detecting anamolies. The spot size 
and said directing and causing means are such that the 
surface is inspected at a speed not less than about 1.5 
cmVs. 

10 Still one more aspect of the invention is 

directed towards a system for detecting anamolies on a 
surface, comprising means for directing a beam of light 
toward said surface to illuminate an area of the surface 
defining a spot having a spot size whose minimum width 

15 is in the range of about 5 to 15 microns; means for 
causing relative motion between the beam and the surface 
so that the beam scans a path covering substantially the 
entire surface; and means for collecting light scattered 
along said path for detecting anamolies. The surface 

20 has dimensions of not less than 200 millimeters in any 
direction along the surface. The directing and causing 
means are such that the beam scans substantially the 
entire surface in about 50 to 90 seconds. 

Brief Description of the Drawings 
25 Fig. 1A is a schematic view of an elliptical- 

shaped illuminated area or spot on a surface to be 
inspected to illustrate the invention. 

Fig. IB is a graphical illustration of the 
illumination intensity across the width or short axis of 
3 0 the elliptical spot of Fig. 1A for defining a boundary 
of the spot and to illustrate the invention. 

Fig. 2 show partially in perspective and 
partially in block diagram form a system for inspecting 
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anamolies of a semiconductor wafer surface to illustrate 
the preferred embodiment of the invention. 

Fig. 3 is a perspective view showing in more 
detail the illumination and collection features of the 
system of Fig. 2. 

Fig. 4 is a schematic view of a small portion 
of a semiconductor wafer surface illustrating the scan 
path of an illumination spot on the surface to 
illustrate the preferred embodiment. 

Fig. 5 is a schematic view illustrating the 
illumination and collection angles of the system of Fig. 
3. 

Fig. 6 is a schematic view of three elliptical 
illuminated areas or spots to illustrate the scanning 
and data acquisition processes of this invention. 

Figs. 7A, 7B are side views illustrating two 
different polarization schemes employed by present 
invention for illuminating a surface to be inspected. 

For simplicity , identical components in the 
different figures of this invention are labeled by the 
same numerals. . 

Detailed Description of the Preferred Embodiment 

Fig. 1A is a schematic view of an elliptical- 
shaped illuminated area (or spot) of a surface inspected 
by the system of this invention to illustrate the 
invention. As explained below, the laser beam 
illuminating the surface inspected approach the surface 
at a grazing angle, so that even though the illumination 
beam has a generally circular cross-section, the area 
illuminated is elliptical in shape such as area 10 in 
Fig. 1A. As known to those skilled in the art, in light 
beams such as laser beams, the intensity of the light 
typically does not have a flat distribution and does not 
fall off abruptly to zero across the boundary of the 
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spot illuminated , such as at boundary 10a of spot 10 of 
Fig. 1A. Instead, the intensity falls off at the outer 
edge of the illuminated spot at a certain inclined 
slope, so that instead of sharp boundaries such as 
5 boundary 10a illustrated in Fig. 1A, the boundary is 
typically blurred and forms a band of decreasing 
intensity at increasing distance away from the center of 
the illuminated area. 

In many lasers, the laser beam produced has a 

10 Gaussian intensity distribution, such as that shown in 
Fig. IB. Fig. IB is a graphical illustration of the 
spatial distribution of the illumination intensity in 
the Y direction of a laser beam that is used in the 
preferred embodiment to illuminate spot 10 of a surface 

15 to be inspected as shown in Fig. 1A, and thus is also 
the illumination intensity distribution across spot 10 
in the Y direction. As shown in Fig. IB, the 
illumination intensity has been normalized so that the 
peak intensity is 1, and the illumination intensity has 

2 0 a Gaussian distribution in the Y direction as well as in 

the X direction. Points 12 and 14 are at spatial 
locations yl and y5 at which points the illumination 
intensity drops to 1/e 2 of the peak intensity, where e 
is the natural number. AS used herein to describe the 

25 preferred embodiment, the minimum width of spot 10 is 
the distance between these two points (distance between 
yl and y5) is the length of the short axis of elliptical 
illuminated area 10 and denoted as the width w in Fig. 
1A. The spot 10 is defined by the area within a 

30 boundary 10a where the illuminaton is 1/e 2 of that of 
the maximum intensity of illumination at the center of 
the spot. 

As a broader definition, "minimum width" of 
the elliptical spot 10a may be defined as the minimum 

3 5 distance between two parallel lines that enclose between 
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them the boundary of the area or spot. In reference to 
spot 10 in Fig. 1A, for example , if one were to draw two 
parallel lines enclosing the boundary 10a, such as lines 
ql, q2, the distance between lines ql, q2 is d, which is 
5 minimized when both ql, q2 touch the boundary 10a. The 
distance d is minimum for all directions of ql, q2 when 
the lines ql, q2 coincide with grid lines yl, y5, so 
that the minimum width of the spot 10 is w. Even where 
10a is not an ellipse, but is of another shape such as 

10 rectangular, square, or irregular in shape, the same 
broader definition is applicable. 

Fig. IB shows only the main lobe of the laser 
or light beam. It is known that the main lobe is also 
accompanied by sidelobes, so that areas of the surface 

15 outside of area or spot 10 would also be illuminated; 
scattering by structures of the surface of the light in 
the sidelobes and collected by the detectors causes 
noise. 

In the description above, it is indicated that 

2 0 for a spot which is relatively small compared to the 

size of the surface to be inspected, it will be 
difficult to maintain uniformity of the spot across a 
scan line which spans the entire length or width of the 
wafer. In reference to Figs. 1A, IB, variation in the 
25 minimum width (as defined above! w of the main lobe of 
the focal plane intensity distribution, and the level of 
the sidelobes is a measure of the uniformity of the spot 
10 as the beam scans across the surface. Where the 
minimum width and the sidelobes level vary little over 

3 0 the scan line, the spot is said to be uniform. In other 

words, when the spot size is relatively small compared 
to the size of the surface to be inspected, it will be 
difficult to maintain uniform width of the main lobe and 
uniform level of the sidelobes of the focal plane 
3 5 intensity distribution throughout the extent of a long 
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scan line across the entire width of the wafer. a 
variation in either one of these two parameters (width 
of main lobe and sidelobe level) leads to a variation in 
detection sensitivity along the scan direction which is 
5 undesirable. 

In view of the above problems , Applicants have 
invented a surface inspection system where the size of 
the area illuminated by the scanning light beam can be 
reduced while maintaining uniform detection sensitivity 

10 by causing the scanning light beam to scan short scan 
path segments having a spatial span less than the 
dimension of the surface it is scanning, as illustrated 
in the preferred embodiment in Figs. 2 and 4, where 
these short scan path segments are not connected 

15 together but are located so that they form arrays of 
scan path segments as illustrated in more detail below. 

The surface inspection system of this 
invention will now be described in reference to Figs. 2 
and 3. As shown in Fig. 2, system 20 includes a laser 

20 22 providing a laser beam 24. Beam 24 is expanded by 
beam expander 2 6 and the expanded beam 2 8 is deflected 
by acousto-optic deflector (A0D) 30 into a defected beam 
32. The deflected beam 32 is passed through post-AOD 
and polarization selection optics 34 and the resulting 

2 5 beam is focused by telecentric scan lens 3 6 onto a spot 

10 on surface 4 0 to be inspected, such as that of a 
semiconductor wafer, photomask or ceramic tile, 
patterned or unpatterned. 

In order to move the illuminated area that is 

3 0 focused onto surface 4 0 for scanning the entire surface, 

the AOD 30 causes the deflected beam 32 to change in 
direction, thereby causing the illuminated spot 10 on 
surface 40 to be scanned along a scan line 50. As shown 
in Fig. 2, scan line 50 is preferably a straight line 
3 5 having a length which is smaller than the dimension of 
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surface 4 0 along the same direction as the scan line. 
Even where line 50 is curved, its span is less than the 
dimension of surface 4 0 along the same general 
direction. After the illuminated spot has completed 
scanning surface 40 along scan line 50 , surface 40 of 
the wafer is moved along the X axis so that the 
illuminated area of the surface moves along arrow 52 and 
AOD 30 causes the illuminated spot to scan along a scan 
line parallel to scan line 50 and in adjacent position 
spaced apart from scan line 50 along the negative X 
axis. After the illuminated spot has covered such scan 
line, surface 40 is moved by a small distance so that 
the area of the surface to be illuminated is moved along 
direction 52 in order to scan an adjacent scan line at 
a different X position. As described below, this small 
distance preferably is equal to about one quarter of the 
height of spot 10. This process is repeated until the 
illuminated spot has covered strip 54; at this point in 
time the illuminated area is at or close to the edge 
54<a. At such point, the surface 40 is moved along the 
Y direction by about the length of scan line 50 in order 
to scan and cover an adjacent strip 56, beginning at a 
position at or close to edge 56a. The surface in strip 
5 6 is then covered by short scan lines such as 5 0 in a 
similar manner until the other end or edge 5 6b of strip 
56 is reached at which point surface 50 is again moved 
along the Y direction for scanning strip 58. This 
process is repeated prior to the scanning of strip 54, 
56, 58 and continues after the scanning of such strips 
until the entire surface 40 is scanned. Surface 40 is 
therefore scanned by scanning a plurality of arrays of 
short path segments the totality of which would cover 
substantially the entire surface 40. 

Fig. 4 is an exploded view of a portion of the 
two strips 54, 56 and smaller portions of two other 
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neighboring strips to illustrate in more detail the 
above-described scanning process. In the preferred 
embodiment as shown in Fig. 4, the optical beam 38 scans 
in only one direction as illustrated by the arrows of 
scan path segments 50 , 50*, 50*', 50 ' ' 1 . Scan path 50 
has an effective start location at 72 and spot 10 moves 
to the right therefrom until it reaches the border 5 5 
between strips 54 and 56. Upon reaching border 55, a 
stage (see Fig. 3) moves the surface 40 in the X 
direction perpendicular to the scanning direction Y and 
the spot assumes the new start position 74 and moves 
along a scan line 50' parallel to scan line 50. The 
movement of the spot 10 along scan lines 50, 50', 50 ,f , 
50 " " • and so on is achieved by means of AOD 3 0 as 
explained below. 

The deflection of beam 32 by AOD 30 is 
controlled by chirp generator 80 which generates a chirp 
signal. The chirp signal is amplified by amplifier 82 
and applied to the transducer portion of AOD 30 for 
generating sound waves to cause deflection of beam 32 in 
a manner known to those skilled in the art. For a 
detailed description of the operation of the AOD, see 
"Acoustooptic Scanners and Modulators," by Milton 
Gottlieb in Optical Scanning , ed. by Gerald F. Marshall , 
Dekker 1991 , pp. 615-685. Briefly, the sound waves 
generated by the transducer portion of AOD 30 modulates 
the optical refractive index of an acoustooptic crystal 
in a periodic fashion thereby leading to deflection of 
beam 32. Chirp generator 80 generates appropriate 
signals so that after being focused by lens 36, the 
deflection of beam 32 causes the focused beam to scan 
along a scan line such as line 5 0 in the manner 
described. 

Chirp generator 80 is controlled by timing 
electronic circuit 84 which in the preferred embodiment 
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includes a microprocessor. The microprocessor supplies 
the beginning and end frequencies fl, f2 to the chirp 
generator 80 for generating appropriate chirp signals to 
cause the deflection of beam 32 within a predetermined 
5 range of deflection angles determined by the frequencies 
fl, f2. The auto-position sensor (APS ) Optics 90 and 
APS electronics 9 2 are used to detect the level or 
height of surface 40 and form a part of the Related 
Application. Detectors such as detector 111b collects 

10 light scattered by anamolies as well as the surface and 
other structures thereon along scan line 50 and provides 
output signals to a processor in order to detect and 
analyze the characteristics of the anamolies. 

Fig, 3 is a perspective view of system 2 0 of 

15 Fig. 2 showing in more detail the arrangement of the 
collection or detection channels to illustrated the 
preferred embodiment. As shown in Fig. 3, four 
collection channels are used, two channels 110a, 110b 
for collecting scattered light that is within the 

20 respective ranges of azimuthal angles of -(75-105)° and 
(75-105)°. Two additional collection channels 111a, 
111b are also employed for detecting forward scattered 
light that is within the respective ranges of azimuthal 
angles of -(30-60)° and (30-60)°. If desired, it is of 

25 course possible to employ four, independent collection 
channels with other different solid angles of 
collection, two of said collection channels located in 
the forward direction to collect light in the forward 
direction centered substantially at ±45° azimuthally and 

30 two of the channels are located to collect light 
centered substantially at ±90° azimuthally. 

Fig. 5 is a top view of the angles of 
collection of the four detectors. As shown in Fig. 5, 
the solid angles of collection of channels 110a, 110b 

35 are labeled <J>! and those for channels 111a, 111b are 
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labeled <3> 2 . To simplify the drawing , the components 
between laser 22 and focus beam 38 are not shown in Fig. 
5. In reference to Fig. 3, system 20 also includes an 
imaging channel 121 and an alignment/registration 
5 channel 122. 

Surface 40 may be smooth (118) or patterned 
(119). The incident focus beam 38 is preferably in the 
range of about 10-85° to the normal direction 150 to the 
surface 40 and more preferably within the range of 50- 
10 8 0° from the normal; in Fig. 3 f this angle is labelled 
0. The four channels of collection are preferably at 
elevation angles OC that will collect scattered light 
from 3-30° from the plane of surface 40. 

Improved Sensitivity of Detection 

15 From the point of view of sensitivity of 

detection, it is desirable to design the illumination 
optics portion of system 20 so that the minimum width w 
of the illuminated spot 10 is minimized. The minimum 
width w is proportional to the focal length of lens 36 

2 0 and inversely proportional to the beam diameter of beam 
2 8 and 32. Therefore, the minimum width w can be 
reduced by reducing the focal lenth of lens 36 or 
increasing diameter of beam 28 f or both. If the focal 
length of lens 3 6 is increased, however, this will 

25 increase the length of scan line 5 0 which may be 
undesirable. If the diameter of beam 2 8 becomes 
comparable to the clear aperture of the crystal in AOD 
30, this will produce higher level sidelobes which is 
undesirable. As noted above, increased level of 

30 sidelobes will increase background signal level. 
Applicants discovered that it is preferable for the 
ratio k between the clear aperture of the crystal in the 
AOD 30 to diameter of beam 28 and 32 to exceed 1.2. 
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It is possible to increase the beam diameter 
of beam 28 and 32 by employing a long AOD crystal , while 
maintaining k to be above 1.2. However, in addition to 
cost considerations, a larger AOD crystal will cause 
5 larger losses, thereby degrading the diffraction 
efficiency of the AOD device. For this reason, it is 
desirable to employ AOD crystals that are as small as 
possible, while at the same time meeting the sensitivity 
and throughput requirements. Assuming that the beam 2 8 
10 that is entering the AOD 3 0 has a Gaussian intensity 
profile, the clear aperture of the AOD, p, satisfies: 

D = Aklv /Uw/\f , ( 1 ) 

where n is the ratio of the circumference of a circle 
to its diameter. 

Where 1 is the scan line of scan path segment 

15 50, v is the acoustic velocity in the AOD crystal 3 0, w 
is the length of the short axis of the elliptical spot 
(or the minimum width of the spot if not elliptical) on 
surface 40, Af or (f2-fl) is the bandwidth of the AOD 
- 30. The constant k is preferably in the range 1.2-5. 

20 In one embodiment, k is 1.7 and 1 is in the range of 
about 2-10 millimeters. 

Throughput Considerations 

For a semiconductor wafer inspection 
instrument to be used for wafer inspection in actual 

25 production for inspecting the entire surface of the 
wafer, throughput considerations are paramount. 
Therefore, in addition to sensitivity capability 
described above, it is also desirable for the wafer 
inspection system of this invention to have a high 

3 0 throughput. The time required for inspecting 
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semiconductor wafers first includes the time required 
for the illuminating light beam to scan the entire 
surface of the wafer. To perform the above-described 
short scan path segment scans , the time required to scan 
5 the entire surface depends on a number of factors. One 
factor obviously is the angle of illumination of the 
illuminating beam, or the value of 0, that is the angle 
between the illuminating beam and normal 150 to surface 
4 0 to be. inspected shown in Fig. 3. The larger the 

10 value of 0 (that is, the smaller the grazing angle of 
incidence), the more elongated would be the shape of the 
spot 10 in Fig. 1A, and the larger is the area being 
inspected. Another factor affecting throughput is the 
fact that the intensity distribution of the illuminating 

15 beam is typically not flat but varies, such as in the 
form of a Gaussian distribution. Therefore, the 
intensity of scattering from a location on a surface 
would depend on the intensity of the illuminating light 
at that location. In order to compensate for such 

20 variation of intensity, a number of data points are 
obtained from the scattering from the particular 
location of the surface as the spot is moved across the 
location in a manner illustrated in Fig. 6 described 
below. 

25 Fig. 6 is a schematic view of three positions 

of the illuminated area on a surface to be inspected to 
illustrate the scanning and data gathering process of 
system 20. As shown in Fig. 6, at one instant in time, 
beam 38 illuminates an area 10 on surface 40. Area or 

3 0 spot 10 is divided into sixteen pixels by grid lines xl- 
x5, yl-y5. In this context, the term "pixel" is 
meaningful only in reference to the taking of data 
samples across the intensity distribution such as that 
in Fig. IB and subsequent data processing and is 

35 borrowed from data sampling and processing in other 
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technologies such as video technology. The pixel that 
is bounded by grid lines x2, x3 and y2, y3 is pixel P 
shown as a shaded area in Fig. 6. If there is an 
anamoly in this pixel P, and if the light illuminating 
5 pixel P has the intensity distribution as shown in Fig. 
IB with a high intensity level between grid lines y2 and 
y3, light scattered by the anamoly would also have a 
high intensity. However , as the beam moves along the Y 
axis so that the area 10* is illuminated instead, pixel 

10 P is still illuminated but at a lower intensity level of 
that between grid lines yl and y2 ; in reference to Fig. 
IB, the intensity of the illumination is that between 
grid lines yl and y2 in Fig. IB. Therefore, if the 
sampling rate employed by the processor 200 in Fig. 3 

15 for processing light detected by the collection or 
collector channels 110a, 110b, 111a, 111b is such that 
a sample is taken when the illuminating beam is in 
position 10 and when the illuminating beam is in 
position 10', then two data points will be recorded. 

20 Thus if pixel P contains an anamoly, then two data 
points will be taken, one when the illumination is at a 
higher level as illustrated by data point D2 in Fig. IB 
and another one when the illumination is at a lower 
level, illustrated at data point pi in Fig. IB. If 

25 position 10 is not the starting position in the short 
scan path segment 50 illustrated in Figs. 3 and 4, then 
two prior samples would have been taken prior to the 
time -when the illuminating beam illuminates the surface 
4 0 in position 10, so that the processor would have 

3 0 obtained two more data points D3, D4 corresponding to 
the prior positions of the illuminating beam when light 
of intensity values between grid lines y3, y4 and 
between y4, y5 respectively illuminates such pixel P. 
In other words, four separate data points D1-D4 would 

3 5 have been taken of the light scattered by the anamoly in 
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pixel P as the illumination beam illuminates pixel p 
when scanning along the Y direction. 

In most laser beams, the beam intensity has a 
Gaussian distribution not only in the Y direction but 
5 also in the X direction. For this reason, after the 
illuminating beam completes the scanning operation for 
scanning a short scan path segment such as segment 50 as 
shown in Fig. 4, and when the illuminating beam returns 
to position 74 for scanning the adjacent short scan path 

10 segment 50', it is desirable for the illuminated area 
along path 50' to overlap that of scan path 50 so that 
multiple samples or data points can again be taken also 
along the X direction as well as along the Y direction. 
Spot 10 is not drawn to scale in Fig. 4 to show overlap 

15 between adjacent scan segments. Therefore, when the 
illumination beam is scanning along scan line 50' from 
starting position 74 as shown in Fig. 4, the area 
illuminated would overlap spot 10; this overlapping spot 
is 10' • as shown in Fig. 6, where the spot 10' 1 is 

20 displaced along the negative X direction relative to 
spot 10 by one quarter of the long axis of the ellipse 
10 and 10' 1 . 

As described above, the minimum width (that 
is, length of short axis) of the spots 10, 10', 10 V 1 is 

25 w. If the angle between the illuminating light beam and, 
normal 150 to the surface 40 to be inspected is 0 as 
shown in Fig. 3, then the magnitude of the long axis of 
the ellipse 10, 10', 10 f ' is w/cos6. Therefore, in 
each short scan path segment, the area illuminated 

3 0 sequentially by the illuminating light beam is given by 
(w/cos0)*l, where 1 is the length of the scan path 
segment such as 50. Thus if the radius of surface 40 is 
R and T is the time it takes for the beam to scan the 
short scan path segment, then the time it takes for the 

35 illuminating beam to scan across the entire wafer is 
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given by NlIR 2 Tcos0/lw (where the duty factor and the 
time required for illumination optics to move the beam 
between strips, such as strips 54, 56 have not been 
accounted for). In* this expression, N is the number of 
5 pixels along the X axis in each spot such as 10, 10 V, 
10' 1 , since each pixel on the surface will be 
illuminated N number of times during the scanning 
process to account for the variation of intensity of 
illumination in the X direction as described above. In 

10 the preferred embodiment illustrated in Fig. 6, where 
four data points are taken in both the X and Y 
directions , N has the value 4 . 

In the scanning process described above in 
reference to Figs. 2-4, it is noted that it will require 

15 time for the illumination optics to move the 
illumination spot between strips, such as strips 54 and 
56. If T is the time required to move the illumination 
spot between strips, then this additional time should be 
accounted for to give the total time required to scan 

20 the entire wafer surface. In the preferred embodiment 
described above, a stage 124 which includes a motor is 
used in order to move the surface so as to move the 
illumination spot from the position for scanning one 
strip on the surface to the adjacent strip as shown in 

25 Figs. 2 and 3. For a circular .. wafer of radius R, the 
spot will need to be moved 2R/r|l times between adjacent 
strips to move the spot across all the strips on the 
entire wafer, so that the additional time required is 
2RT/r)l, where I] is the duty factor (explained below). 

3 0 As known to those skilled in the art, when AOD 

3 0 is used to cause beam 38 to scan along each short 
scan path segment such as 50, time will be required at 
the beginning of the scan for the sound waves generated 
by the transducer portion of the AOD to reach the far 

35 end of the AOD crystal so as to begin deflecting the 
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beam. This is accounted for by a quantity called the 
duty factor r| given by equation 2 below, and therfore, 
the total t e it takes system 20 to scan the entire 
surface of a wafer with radius R is given by equation 3 
below: 



wnj? 2 rcos 0 + 21? x v 

t = — ( 3) 



From equation 3 above, it is evident that the 
shorter the time T to scan along a scan path segment 
such as 50, the shorter will be the time required to 
scan the entire wafer and therefore the higher the 
throughput. The time T is referred to as the chirp 
duration which also determines the data rate. The speed 
of the electronic circuit for processing the data 
ultimately sets a lower limit for the chirp duration. 

From equation 1 above, for a given spot size, 
length of the scan path segment and the value of k 7 it 
is evident that the larger the bandwidth Af or f2-fl, 
the smaller will be the clear aperture required of the 
AOD. To get maximum bandwidth from the AOD, the AOD 
should be operated at the highest possible frequency and 
one then expects to get one octave bandwidth around the 
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center frequency of the transducer. However/ the 
acoustic losses in the AOD crystal increase with the 
center frequency of operation. Large acoustic losses 
can cause two major problems: reduction in diffraction 
5 efficiency and thermal errors induced in the crystal. 
A reduction in the diffraction efficiency reduces the 
sensitivity of the system to small particles. When the 
AOD transducer is operated at high frequencies , more of 
the acoustic energy will be converted into heat which 

10 sets up thermal gradients in the AOD .crystal. Such 
thermal gradients would cause errors by degrading the 
focal spot which in turn leads to a reduction in 
sensitivity for detecting anamolies. It is therefore 
advantageous to minimize the acoustic losses by 

15 selecting as low a center frequency of the transducer as 
possible. A compromise should then be found to yield 
acceptable detection sensitivity as well as acceptable 
throughput. Applicants found that a center frequency in 
the range of 50-300 megahertz and a bandwidth preferably 

20 within the range of 50-250 megahertz would be 
acceptable. The AOD 30 is preferably driven by a linear 
frequency modulated (FM) chirp signal from generator 80 
in Fig. 2. The quantity r|l is the effective length of 
the scan path segment; in the preferred embodiment the 

25. effective length is in the range, of 2 to 10 mm but more 
preferably has a value of about 5.47 ram, where 1 has the 
value of 6.2 mm. 

From equation 3 above , it is seen that the 
larger the angle 0, the higher will be the throughput , 

3 0 since the illuminated spot will cover a larger area of 
the surface. But as noted above, the larger the spot 
size f the lower will be the sensitivity of detection. 
In the preferred embodiment, 0 is in the range of 10- 
85° and more preferably in the range of 50-80°. 
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Also from equation 3 above, it is evident that 
the larger the number of samples taken across the 
illuminated spot diameter, the more time it would take 
to scan the wafer. In the preferred embodiment, the 
5 number of samples taken across the illuminated spot 
diameter along both orthogoanl axes (X, Y ) is in the 
range of 2-10. Where four samples are taken along at 
least the X axis, N is 4 in equation 3. 

For sensitivity considerations, it is 
10 preferable for the minimum width w of the illuminated 
area to be in the range of 5-15 microns. If 0 is in 
the range of 50-80°, then the illuminating beam will 
illuminate the scan path segments such as 5 0 at such 
speed that the surface is inspected at a speed not less 
15 than about 2.5cm 2 /s, and more preferably in a range of 
about 2 .5-3 . 8cm 2 /s . 

From equation 3 above, if the time required 
for moving the wafer or the illumination beam so that 
the illuminated spot is transferred between adjacent 

2 0 strips such as strips 54, 56 is taken into account, then 

the average speed for scanning the entire surface 4 0 
will be reduced compared to that for scanning a short 
scan path segment such as segment 50. Furthermore, the 
speed for inspecting the entire wafer is further reduced 
25 because each pixel on the wafer is scanned multiple 
times as described above in reference to Fig. 6. If the 
value of T is about 0.3 seconds, and where the scan 
speed along each scan path segment is not less than 
2.5cm 2 /s, then the average speed for the illumination 

3 0 beam scanning the entire surface is not less than about 

1.5cm 2 /s. In the preferred embodiment , the average 
speed is preferably within the range of about 1.5- 
5cm 2 /s. If the surface 40 scanned has dimensions of not 
less than 200 millimeters in any direction along the 
3 5 surface, then the illumination beam will scan the entire 
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surface in about 50-90 seconds. As noted above, the 
length of the scan path segments such as segment 50 is 
preferably small compared to the dimensions of the 
surface 40 inspected. In the preferred embodiment , 
these segments are substantially in the range of about 
2-10 millimeters. 

In the preferred embodiment , generator 8 0 
supplies a linear FM chirp signal to drive the AOD so 
that the chirp duration is preferably in the range of 
20-200 microseconds , and more preferably in the range of 
about 80-120 microseconds. The beam 28 before 
deflection by the AOD 3 0 has at least one cross- 
sectional dimension (e.g. the longer dimension) in the 
range of about 4-12 millimeters. Preferably, the scan 
lens 36 is placed substantially at one focal length away 
from AOD 3 0 so that beam 3 8 scans the surface 4 0 
telecentrically . 

From the above, it will be evident that the 
objective of the invention of the high sensitivity and 
high throughput surface inspection system has been 
achieved while moderate data rate (e.g. 22 Mhz) at 
modest cost for the data sampling and processing 
electronics can still be achieved. This system is 
capable of inspecting patterned wafers with 0.35 micron 
design rule, such as patterned -wafers for 64 and 256 
megabit DRAM technology. The system is capable of 
detecting contaminant particles and pattern defects on 
memory and logic devices. With the present state-of- 
the-art robotic implementation for removing and 
replacing wafer 4 0 on stage 124 ready for system 2 0 to 
inspect and the inherent delay (about 25 seconds /waf er ) 
involved therein, system 20 described above is capable 
of inspecting in excess of about 40 wafers per hour for 
150 millimeter diameter wafers (6-inch wafers), in 
excess of about 20 wafers per hour for 200 millimeter 
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diameter wafers (8-inch wafers) and in excess of about 
10 wafers per hour for 300 millimeter diameter wafers 
(12 -inch wafers) . 

Figs, 7A, 7B are side views showing the 
5 polarization schemes employed by the present invention. 
It is found that by employing certain polarization 
schemes , the signal-to-background of the system can be 
substantially improved. The polarization scheme 
employed may be surface-dependent and may also be used 

10 to determine the composition of the anamoly (such as 
metallic as opposed to dielectric material) • For 
pattern defects/ the polarization elements included in 
the post-AOD and polarization selection optics 34 of 
Fig. 2 faces the illumination beam in a state of either 

15 P or S polarization. Fig. 7A illustrates the situation 
where the illumination beam 214 is in a state of S 
polarization where the electrical field E is 
perpendicular to the plane of incidence which is defined 
by the incident beam 214 and the specularly reflected 

20 beam 214a; this plane of incidence is parallel to the 
plane of the paper. A vector representation of the beam 
is shown by a k vector representing the direction of 
propagation. The magnetic field is shown as the H 
vector. The electric field vector is shown as being 

25 perpendicular to the plane of incidence by representing 
it with a dot and labeled E. In Fig. 7B, the beam 214 
is in a state of P polarization where the electric field 
E is in the plane of incidence and the plane of the 
paper. In Fig. 7B the beam 214 is shown in vector form 

3 0 with a propagation vector k, a magnetic field vector 
shown as a dot labeled H where the electric field vector 
E is perpendicular to the propagation vector k. Instead 
of P or S polarization states, the illumination beam can 
also have a left or righthanded circular polarization. 

3 5 Where the polarization state of the illumination beam is 
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chosen to optimize signal-to-noise background and for 
defect detection, the collector or collection channels 
include polarization filters to pass light of 
predetermined polarization states to enhance detection 
5 capability and signal-to-noise ratio. 

While in the invention described above , the 
scan path segments are described and illustrated as 
straight lines , it will be understood that it is also 
possible for curved scan lines to be employed, such as 

10 where the wafer is rotated about an axis instead of 
translated along straight lines in the X and Y 
directions as described above. While in the preferred 
embodiment described above, the short scan path segments 
form arrays, each array covering a substantially 

15 rectangular strip of the wafer, it will be understood 
that other different arrangements of the scan path 
segments are possible for covering the entire or 
substantially the entire surface 40; such and other 
variations are within the scope of the invention. As 

2 0 the spot 10 approaches the edge of surface 40, the 

length of the scan path segment may be reduced so that 
the spot does not fall outside surface 40. All the 
advantages described are obtained even though the 
segments are of different lengths if each of at least 
25 some of the segments has a span shorter than the 
dimensions of the surface. Also, the AOD 30 may be 
replaced by a polygonal scanner or galvonometer . While 
the invention has been described by reference to 
preferred embodiments, it will be understood that 

3 0 modifications and changes can be made without departing 

from the scope of the invention which is to be defined 
only by the appended claims. 



